A graphical selection method for parametric models in noisy inhomogeneous regression
نویسندگان
چکیده
A common problem in physics is to fit regression data by a parametric class of functions, and to decide whether a certain functional form allows for a good fit of the data. Common goodness of fit methods are based on the calculation of the distribution of certain statistical quantities under the assumption that the model under consideration holds true. This proceeding bears methodological flaws, e.g. a good “fit” albeit the model is wrong might be due to over-fitting, or to the fact that the chosen statistical criterion is not powerful enough against the present particular deviation between model and true regression function. This causes particular difficulties when models with different numbers of parameters are to be compared. Therefore the number of parameters is often penalised additionally. We provide a methodology which circumvents these problems to some extent. It is based on the consideration of the error distribution of the goodness of fit criterion under a broad range of possible models and not only under the assumption that a given model holds true. We present a graphical method to decide for the most evident model from a range of parametric models of the data. The method allows to quantify statistical evidence for the model (up to some distance between model and true regression function) and not only absence of evidence against, as common goodness of fit methods do. Finally we apply our method to the problem of recovering the luminosity density of the Milky Way from a de-reddened COBE/DIRBE L-band map. We present statistical evidence for flaring of the stellar disc inside the solar circle.
منابع مشابه
Regression Modeling for Spherical Data via Non-parametric and Least Square Methods
Introduction Statistical analysis of the data on the Earth's surface was a favorite subject among many researchers. Such data can be related to animal's migration from a region to another position. Then, statistical modeling of their paths helps biological researchers to predict their movements and estimate the areas that are most likely to constitute the presence of the animals. From a geome...
متن کاملSemi-parametric Quantile Regression for Analysing Continuous Longitudinal Responses
Recently, quantile regression (QR) models are often applied for longitudinal data analysis. When the distribution of responses seems to be skew and asymmetric due to outliers and heavy-tails, QR models may work suitably. In this paper, a semi-parametric quantile regression model is developed for analysing continuous longitudinal responses. The error term's distribution is assumed to be Asymmetr...
متن کاملComparing Structure Learning Methods for RKHS Embeddings of Protein Structures
Non-parametric graphical models, embedded in reproducing kernel Hilbert spaces, provide a framework to model multi-modal and arbitrary multi-variate distributions, which are essential when modeling complex protein structures. Non-parametric belief propagation requires the structure of the graphical model to be known a priori. Currently there are nonparametric structure learning algorithms avail...
متن کاملParametric versus non - parametric modelling ? Statistical evidence based on P - value curves
In astrophysical (inverse) regression problems it is an important task to decide whether a given parametric model describes the observational data sufficiently well or whether a non-parametric modelling becomes necessary. However, in contrast to common practice this cannot be decided by solely comparing the quality of fit due to possible over-fitting by the non-parametric method. Therefore, in ...
متن کاملA Comparison between New Estimation and variable Selectiion method in Regression models by Using Simulation
In this paper some new methods whitch very recently have been introduced for parameter estimation and variable selection in regression models are reviewd. Furthermore , we simulate several models in order to evaluate the performance of these methods under diffrent situation. At last we compare the performance of these methods with that of the regular traditional variable selection methods such ...
متن کامل